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Abstract 

A very simple example of an algorithmic problem solvable by dynamic programming is 
to maximize, over A C {1,2,..., n}, the objective function \A\ — J^i ^ A, i + 1 G A) 
for given > 0. This problem, with random (^i), provides a test example for studying 
the relationship between optimal and near-optimal solutions of combinatorial optimization 
problems. We show that, amongst solutions differing from the optimal solution in a small 
proportion S of places, we can find near-optimal solutions whose objective function value 
differs from the optimum by a factor of order 6^ but not smaller order. We conjecture 
this relationship holds widely in the context of dynamic programming over random data, 
and Monte Carlo simulations for the Kauffman-Levin NK model are consistent with the 
conjecture. This work is a technical contribution to a broad program initiated in Aldous- 
Percus (2003) of relating such scaling exponents to the algorithmic difficulty of optimization 
problems. 
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1 Introduction and Motivation 



1.1 Near-optimal solutions in combinatorial optimization 

Consider a combinatorial optimization problem which is "size n" in the sense that a feasible 
solution X = (xj, 1 < i < n) consists of n elements (e.g. edges of a graph; binary digits) subject 
to some constraints, and the objective function /(x) is akin to a sum over i of costs or rewards 
associated with each Xi. In such a setting one can define the relative distance between the 
structure of a feasible solution x and the optimal solution x* by 

5„(x) = : Xi / x*}\ 

and the relative difference in objective function is n^^\f(x) — /(x*)|. So the quantity 

e„(5) :=min{n-V(x)-/(x*)| : 5„(x) > 5} (1) 

measures how close we can get to the optimal value using feasible solutions which have non- 
negligibly different structure from the optimal solution. A program initiated in [3j is to study 
this quantity for combinatorial optimization problems over random data. In this setting £niS) 
becomes a random variable, but in many cases one expects that as n — > cxd there is a deterministic 
limit function £{6). Motivation for this program is a conjecture that (within some suitable class 
of problems) 

e{6) X 5" as (5 ^ 

for some scaling exponent a, whose value is robust under model details, and that for "algorithmi- 
cally easy" problems we have a = 2 (which of course mimics the behavior we expect by calculus 
for smooth functions f : MJ^ M.) whereas for "algorithmically hard" problems we have a > 2. 
Here is the previous evidence in support of this conjecture. 

(i) Traveling salesman problem and minimum matching problem [3\. In the random link 
(mean-field) model, a cavity method analysis (non-rigorous but generally regarded as accurate) 
enables one to compute e{5) numerically and to observe scaling exponent a = 3. In the random 
Euclidean model, Monte Carlo simulations suggest the same a = 3. 

(ii) Minimum spanning tree. Here we expect a = 2. This is proved in [2] for the d > 2 
dimensional random Euclidean model and also for a "disordered lattice" model. 

The purpose of this paper is to consider some problems which are algorithmically easy to 
solve via dynamic programming, and where we therefore expect a = 2. We first give a trivial but 
instructive case (section [L2]) and then describe a prototypical "interesting" case, the Kauffman- 
Levin NK model (section II. 3p . Here both a heuristic argument and simulations suggest a = 2, 
but we do not have a proof. Our main focus is on giving a complete analysis of a simple non- 
trivial model (section II. 4p where we are required to pick a subset A C [n] := {1,2,... ,n} of 
items with a reward of 1 per item picked and i.i.d. costs incurred if both items i and i + 1 
are picked. Theorem [2] establishes a = 2 for this specific model. In these dynamic programming 
examples and the minimum spanning tree example, the key structural property is that the near- 
optimal solutions attaining the minimum in ([T]) differ from the optimal solution via only "local 
changes", each local change affecting only a number of items which remains 0(1) as (5 — > 0. It is 
natural to speculate that this structural property corresponds quite generally to the a = 2 case. 
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Related work We do not know any other lines of research in theoretical computer science 
which are close to the topic of this paper. A recent survey of average-case complexity of NP 
problems is given in [7j. Interest in the average-case gap between optimal and second-optimal 
solutions arises in several contexts, e.g. [5]. Closer in spirit is the statistical physics of disordered 
systems, where for low temperatures the Gibbs distribution on configurations concentrates on 
near-minimal-cost configurations. In the context of random energy models (the precise analog 
of optimization over random data) , two random picks from the Gibbs distribution over the same 
random choice of energy are called replicas, and study of such replicas and their overlaps is 
a central theme of the replica method \13\ I15j . So that topic studies the structural difference 
between two typical near-optimal configurations, whereas we study the maximal (over all near- 
optimal configurations) structural difference from the optimal configuration. Our mathematical 
arguments are much less sophisticated than those in statistical physics, but there are some 
intriguing parallels, described briefly in section 15. 2i 

1.2 A trivial example 

Let {Xi,i > 1) be i.i.d. real-valued random variables with continuous density h(x) and KX < oo. 
For each n consider the problem of finding 



The maximum is obviously obtained by choosing A = {i : Xi > 1} and then as n — > oo 

n'^Mn E{Xi - 1)+ a.s. 
Fix < 5 < 1. It is also obvious that the subset A' that minimizes 

subject to \A' AA\>6n 

is the subset A' = A A D where D is the set of indices of the [(5n] smallest values of |Xj — 1| 
So as n — > oo 

rl+a{S) 

n-^{Mn- M'^) ^L^e{5) := \x - l\h[x) dx 



where a{6) is defined by 



l-a{6) 



rl+a{5) 

(5 = / h{x) dx. 

Jl~a{5) 



So by continuity of h{x), and assuming < /i(l) < oo, as (5 J, we have 

«('^)~24t); e{6) a\5)h{l) ^ (2) 
which is the desired "scaling exponent = 2" result. 
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Discussion, (i) This example illustrates a feature that arises in other examples, that proving 
a = 2 reduces to showing that the density of a certain measure at a certain point is finite and 
non-zero. In nontrivial examples the measure in question arises in the analysis of the problem 
rather than the statement of the problem: see Lemma [T9l below and Proposition 8 of [2]. 

(ii) In this example we could see the form of the best near-optimal solution by inspection, but a 
systematic method is to use Lagrange multipliers. In this example, introduce a parameter 9 > 
and consider for each n 



Ae := argmax (^(Xi -1) +e\AAA*\] 



where A* = {i : Xi > 1} is the optimal solution. By inspection the solution is 

Ag = {i: l-e<Xi<loil + 9< Xi}. 

Although now \A0 A A*\ is random, we can use the law of large numbers to obtain existence of 
the limits 



6(6) := lim n-^\A* A Ae\ = / 



h{x) dx 



e{e) := Jrn^^^M - 1) - ~ ^) I = / \x-l\dx. 



By the interpretation of Lagrange multipliers, this is an implicit function representation of e as 
a function of 5, and rederives the limit ([2]) above. 



1.3 The NK model 

The Kauffman-Levin NK model of random fitness landscape has attracted extensive literature 
in statistical physics [9l|T7]. For our version of the model we fix > 2. We seek to minimize, over 
binary sequences x = (xi, . . . ,xn), the objective function Hj\[{-k) = J2i=i^ Wi{xi,Xi+i, . . . , Xi+x), 
where the values {Wi{bo,bi, . . . jbx) : i > l,b € {0,1}^^^) are independent exponential(l) 
random variables. This is algorithmically easy via dynamic programming. Write for the 
minimizing sequence. By subadditivity there is an a.s. limit N~^Hn[x.'^) ck- For a general 
sequence y = write 

<5iv(y) = N-^\{l<i<N-K:{yi,...,y,+K)^{xf,...,xf+K))\ 
e7v(y) = N-\HN{y)-HN{-^'')) 

and then set 

eAr((5) = min{eAr(y) : 5N{y) > S}. (3) 
We expect existence of a deterministic limit 

e{6) = a.s.- lim eAr((5). 



4 



A heuristic analysis The purpose of this section is to give a heuristic argument for e{5) x 6'^. 
Given i and I > K + 1, consider the set of sequences y such that 

{yj,---,yj+K) = {xf,...,xf^j^)yj^[i + i,i + i] 

{yj,---,yj+K) + Vj G + + 

Over this set, let Di^i be the minimum of H]\f{y) — ffjv(x^) and let y(*'') be the minimizing 
sequence. The distribution of Di^i essentially depends only on /, not on i or N; write fi{0+) for 
its density at 0+. Let's assume 

l''MO+) = A<oo. (4) 

1>K+1 

It is intuitively clear how to choose a sequence y which minimizes eiv(y) for a given 6. Just 
fix a small 77 > 0, and create a sequence of "excursions" away from x^ as follows. For each 
pair (i, /) such that Di i < -ql, choose y to equal y^*'') on the sites [i + K + l,i + I]; set y = x^ 
elsewhere. See Figure 1. 

01100011010010111010001001101010101101111000101011010 x^ 
011000110110101110100010011010 10001110100 000101011010 y 

Figure 1. Excursions of lengths I = 3 and 11. Here K = 2. 

With this scheme, 5 will be the mean length of possible excursions starting from a given site, 
that is 

1>K+1 

And e is the mean increment of H]\f associated with possible excursions starting from a given 
site, that is 

Y (r///2)-7?//K0+). 

1>K+1 

In other words 5 ~ At], e ~ Arf /2, giving e ^ {2A)^^5^ which is the desired "scaling exponent 
= 2" result. 

Why should the assumption dH) be true? Well, for large / we expect central limit behavior: 
Di ~ Normal (/i/, o"^/) for some fj, > and < cr^ < 00. This in turn suggests that fi{0+) should 
decrease at least geometrically fast in /. 

Note that the optimizing y^ in will have (in the iV — > 00 limit) some distribution Ls of 

excursion lengths. The heuristic argument predicts that as 5 J, we have Ls —>■ L where the 
limit distribution has P(L = I) (x lfi{0+) and EL < 00. 

Simulations (Table 1) with K = 3 are consistent with both the predicted scaling exponent 
2 and the prediction of existence of a 5 | limit distribution L for excursion lengths. Making a 
rigorous proof seems difficult and so we turn to a simpler example. 
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-5 


0.0308 


10.9 


0.004 


0.0774 


2.00 


• 10" 


-4 


0.0334 


11.0 


0.008 


0.147 


7.69 


• 10- 


-4 


0.0354 


11.3 


0.016 


0.266 


2.75 


• 10- 


-3 


0.0388 


11.8 



Table 1. Monte Carlo simulations with K = 'i^N = 10,000; 1000 repeats. These are exact 
optimizations done by introducing a Lagrange multipHer 9 which penahzes matching {K + 1)- 
tuples. We find C3 = 0.3065. 



1.4 Main model and results 

Let (^i, i > 1) be i.i.d. copies of a strictly positive random variable ^, and write G{x) = P(^ < x). 
Define the benefit function 

fn{A)=(^A\-Y,^ii\{ieA,i + l(^A)^, AC {1,2,..., n} (5) 

where l(-B) = 1b denotes the indicator random variable associated with an event B. Intuitively, 
we choose a set A of items, getting reward 1 from each item chosen but paying cost if we 
choose both i and i + 1; we seek to maximize benefit = reward - cost. So we shall study 

M„:= max U{A). (6) 

AC{l,2,...,n} 

To simplify exposition we will assume 

G has bounded continuous density g with g{^) > (7) 

which implies 

< G{\) < 1 (8) 

though we suspect that Theorems [1] and [2] remain true under some much weaker non-degeneracy 
assumptions. See section \5A\ for further remarks. 

We will first prove the following: 

Theorem 1 There exists ^ < c < 1, such that almost surely and in L} , 

lim n^'^Mn = a. 



The constant c is given by the forthcoming formula i31\) . If ^ is an exponential random variable 
with parameter A > then 



We record the explicit value of c only in the exponential case, but one could use formula (|3T 
to obtain explicit values for other standard distributions. 
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We now formalize the setup in the introduction. The optimization problem ^ has a solution, 
a random subset A"^^ C {1,2, ...,n}, and Corollary [5] will show the solution is unique with 
probability — > 1 as n ^ cx). Define the random variable: 

SniS) := mm {n-\U{AT) - fn{B)) : |i? A^^'I > 6n} (9) 

where the minimum is over all subsets B C {!,■■■ ,n} such that the symmetric difference with 
is at least 6n. Our main result is the following: 



Theorem 2 e{5) := lim„Ee„(5) exists for all < 5 < 1, and 



limsupiJ ^e((5) < oo, (10) 
<5iO 

limM6-'^e(6) > 0. (11) 

510 



We now outline the key ideas in the proof, and the organization of the paper. 

• Dynamic programming over i.i.d. data is essentially just study of a related Markov chain 
(section 12. 2p , and in our model there are simple inclusion criteria for whether item i is in 
the optimal solution. The inclusion criterion involves two Markov chains (one looking left, 
one looking right) and the cost (Table 2 and Lemma [5]). 

• By considering the related infinite-time stationary Markov chain and using the same in- 
clusion criteria, we can define a random subset A"'^*' C Z interpretable as the solution of 
an infinite optimization problem (section 12. 3p . 

• The n — > cxD limit benefit in Theorem [T] is just the mean benefit per item using in the 
infinite problem (section 12. 4p . 

• Study of en{S) is an "optimization under constraint" problem, most naturally handled via 
introduction of a Lagrange multiplier 6. So the attaining the maximum in Q can 
be studied as above by introducing a more complicated Markov chain parametrized by 6 
(section l4.ip . finding the inclusion criteria (Table 3), formulating the parallel optimization 
under constraint problem, and observing that e{5) is representable via functions 6{6),£{9) 
defined in terms of the stationary distribution of the more complicated Markov chain 
(Proposition [T2]) . 

Without trying to write details, it seems intuitively clear that the methodology above could be 
implemented in more general dynamic programming models such as the NK model of section 
II. 3[ However, to complete the argument we need to analyze the ^ — > behavior of the functions 
6(9), e{9). Even in our simple model, we do not have any useful explicit expression for the needed 
stationary distribution, so we proceed via inequalities rather than using the exact formulas. 
For the upper bound ()10p we just identify a "local configuration" which can be replaced by a 
different local configuration at small extra cost (section [3]) . For the lower bound, we decompose 
the process into blocks by breaking at certain special configurations, and then get bounds on 
the chance that -6°'" differs from on a block and bounds on the mean decrease in benefit if 
it does differ (section 14. 5p . But these arguments rely on the particular combinatorial structure 
of our special model. It is not clear how readily they can be extended to general models. 
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2 Analysis of optimal solutions 



2.1 Non-uniqueness 

In the case n = 2, if > 1 then both {1} and {2} attain the maximum value 1 of the 
optimization problem ([6|): the optimizing set is not unique. Corollary S] shows that, provided 
some + ^j+i < 1 is less than 1, the optimizing set A"^^ is unique, and by assumption ([SD 
this proviso holds with probability — > 1 as n — > oo. After this section we generally ignore the 
possibility of non- uniqueness. 

We start with some terminology that will also be used later. For an integer interval [g, d] 
with d — g + 1 even, the two complementary alternating subsets Ai, A2 are as shown in Figure 2. 

g - (g + l) - - {d-l) - d 

Ai o— • — o...«— o — • 

A2 •— o — • . . . o — • — o 

Figure 2. Included items marked •, excluded items marked o. 

Lemma 3 Let n > 2. For almost all realizations of (^i, . . . ,(,n-i, the following are equivalent. 

(a) The subset maximizing ^ is not unique. 

(b) n is even and the only optimal solutions are the two complementary alternating subsets of 
[l,n]. 

(c) n is even and = n/2. 

Proof. Either of (b,c) implies (a), so it is enough to show (a) implies (b) and (c). Suppose 
distinct subsets Bi and B2 attain the maximum. Then a.s. the values of used in the optimal 
sum are identical, that is 

{i : (i, i + 1) C Bi} = {i: {i,i + l)c B2} := S, say. (12) 

First suppose S is empty. Then each of Bi and B2 has only isolated elements. But amongst 
such sets, the maximum of ([6]) is attained (for n odd) uniquely by the alternating subset giving 
Mn = (n + l)/2, or (for n even) only by the complementary alternating subsets. So S empty 
implies (b) and (c). For general S, take some i Bi A B2, and then take the maximal interval 
i € [g,d] C [l,n] which is disjoint from S. Repeating the argument above, the restrictions of Bi 
and B2 to [g,d] must be complementary alternating subsets. If [g,d] ^ [^,n] then either d+1 
or 5 — 1 is in 5 - say d+1 - and so d + 1 € -Bi fl -B2 • But exactly one of Bi , B2 contains d, 
contradicting (fT2]) . So [g,d] = [l,n] and so S is empty. ■ 

Corollary 4 If ii + ii+i < 1 for some 1 < i < n — 2 then a.s. A'l^* is unique. 

Proof. Fix i with + ^j+i < 1 and let B be the alternating subset of [l,ra] containing i and 
i + 2. Replacing B hy B \J {i + 1} increases the benefit by 1 — — ^j+i > 0, so S cannot be 
optimal, and the result follows from Lemma [3j ■ 
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2.2 Dynamic programming 

Finding the maximum value and the maximizing subset of ^ is algorithmically easy by dynamic 
programming, as fohows. Define 

Vjt^, = max (|A|-^e,lO-e Aj + lG^) I (13) 

W^^, = max l\A\-Y,CjMjeAJ + leA)] (14) 

which differ in that the former requires i ^ A and the latter requires i ^ A. The superscripts 
L here and R later indicate left and right. Note that in fact V^^, above and below do 
not depend on n, but the notation is useful to distinguish from the limit process later. 

From (jl3|14p we see V^-^ = 1, = and by induction over \ <i 

v;^,+i = i + max(K^,-e^,w^i;J 

W„Vi = max(K^„Vri;J 

the two terms in the max indicating the choice of using or not using element i. Then M„ = 
max(y^„, Wn^n) s-iid by examining which max term is used at each stage leading to M„ we can 
recover the optimizing subset 

We now describe an alternative, more useful way to obtain A^^. First, consider the evolution 
rule for the process 

■ :=V^- -W^- (15) 

n'l ' n,t " n,i X^-'J 



as i increases; the rule is 



X^,i+i = l + max(0,X^_,-^i)-max(0,X^_J 

= l + max(-X^_„-eOl(X^,, >0). (16) 

One can check by induction that < • < 1 and thus rewrite the recursion as 

X^_,+i = max(l-X^,„l-e^). 
For n fixed we define the right processes analogously 

n-l 



Vn^i = ^ max 1^1- J^eil(iG Aj + le^) (17) 

W„^, = .^^^max l\A\-J2^jHj^Aj + ^^A)], (18) 



with V^j^ = 1, VF^„ = 0. Observe that the evolution rule for the process 



^^n,i • * n,i " n,i \^^J 
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as i decreases does not depend on n. In fact, we have 



X. 



max(l 



n,i—l 



(20) 



The point is that we can determine the optimizing random set A"^^ in terms of the quantities 
above. Fix i and consider the quantities {X^i,V^^,Wj;;^^), ^j, {X^^_^_^,V^^^^,W^^_^_^) and drop 
subscripts. We have four choices of whether to include (marked as • in Table 2) or exclude 
(marked as o in Table 2) items i and z + 1 in the optimal set A"^^. For each choice, the 
table shows the absolute benefit of that choice, then the relative benefit (relative to the choice 
to exclude both items). For each i the optimal A"^^ will contain, in positions + 1), the 
combination with largest relative benefit, and the final column indicates the criteria for use of 
each combination. 



a + 1) 



absolute benefit relative benefit 



when used 



X^ 
X^ 




if ^ < mm{X^,X^) 
if X^ < mm{X^,^) 
if X^ 



< mm(X^,^) 
never. 



Table 2. Inclusion criteria for + 1 in 



(The case of non-uniqueness of Lemma [3l is the case where Xj" and X[^ alternate between 

and 1 throughout the interval [l,n], and where we have equalities X^ = X^^ < ^j. Outside 
this case, one of the three strict inequalities holds. We ignore the non-uniqueness possibility in 
the summary below.) 

We summarize the argument above as follows. 
Lemma 5 For each n define Xf^^, 1 < i < n and X^-, 1 < i < n by 

= 1; = ™(i - 1 - ^^)' 1 < ^ < n - 1 (21) 

= 1; = max(l - X^, 1 - ^,_i), 2 < i < n. (22) 

Then A"*" is the random subset of {1, 2, . . . , n} specified by: for each 1 < i < n — 1, 
if ii < min(X^_„ X^,+i) then ieA^, i + le 
if X^i+i < min(X^,„ Ci) then i G A^, i + l^ 
if X^i < min(X^,+i, then i^A^, i + le A^ ■ 

Let us emphasize two points: 

• whether or not i S depends only on the three r.v.s X^-,^i,X^^_^_^ 

• the only place where the value of n enters is as the boundary condition X^^ = 1. 

In the next section, we show how to define a corresponding stationary process ((X/', ^j, X^^j^), — oo < 

1 < oo). By applying the specification in Lemma [5] to this process, we will define a set A""^^ C Z 
which will be shown (Lemma [8]) to be the limit of A'^^. As a consequence, we will be able to 
derive the limit of Mn/n. 
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2.3 A stationary Markov chain and the infinite hmit problem 

The recursion (j2ip specifies a Markov chain on the continuous state space [0, 1] with transitions 

X —5- max(l — x,l — (23) 

where has distribution function G. Write F{x) = P{X^ < x) for a stationary distribution 
function for this chain. Then 

F{x) = P(max(l < x) 
= P(min(X^,0 > 1 -x) 

where for any distribution function F we write F{x) = 1 — F{x). Iterating this identity once 
gives 

F{x) = G(l - x) (1 - G{x)F{x)) 

and solving this equation gives 

Fix)= G{l-.)Gix) 
^ ' l-G{x)G{l-x) ^ ' 

The assumption ([7]) that G has a density implies that F has a density, so in what follows we do 
not need to distinguish carefully between weak and strict inequalities for random variables with 
these distributions. 

Now consider the infinite line graph, with vertices — oo < i < oo and with i.i.d. edge-costs 
on edge + 1) such that P(^o + Ci < 1) > 0) which is ensured by the condition G(l/2) < 1. 

Lemma 6 The recursion 

= max(l - , 1 - ii), -cx)< i < oo (25) 

defines uniquely a joint distribution for {{£^i,X^),—oo <i < oo) in which {X^) is the stationary 
Markov chain with transition kernel 123\) and stationary distribution \24^ . And 

Xt = ^{...,^i-2,^i-i) (26) 
for a certain function (p not depending on i. 

Proof. Having proved existence and uniqueness of the stationary distribution at ()24p . it only 
remains to prove the measurability property ()26p . Iterating ()25p once shows 

1 - e. < ^f+i < max(l - ii, (27) 

So outside the event {1 — < ^i-i} the value of X^^j^ depends only on (^i_i, (^j) and not on the 
value of Xf' . So inductively on Q > 1 there exists a measurable function (j)Q such that 

^1 = <Pq{^-2Q-i,^-2Q, • • • ,^o) outside n°=_Q {1 - 6g < 
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Now ([26]) follows because P (nO^„Q{l - 6<? < 6<?-i}) = i^iCo + 6 > 1))^^^ ^0. ■ 
If we define an "i decreasing" process by 

= ,/.(..., e^+2,^^+l,e^) (28) 

then (Xf^) satisfies the analogous recursion 

= max(l - X/^i, 1 - ^i), -oo < i < oo (29) 

and is distributed as the same stationary Markov chain. Hence we have a rigorous definition of 
a unique (in distribution) stationary process ((Xf ,^i,X/^_i), -oo < i < oo) satisfying (|25T29|l 
which we will call the triple process. Note that from ()26|28p 

for each i the three r.v.'s X/", X/^j^ are independent. (30) 

Lemma 7 Let (Xf'j^ijX^^), —oo < i < oo be the stationary triple process. Then there is a 
random subset A°''* of Z specified by: for each — oo < i < oo, 

if < min(X/', X/^i) then i G A""', i + 1 G A""' 
if X/^i < min(Xf, ^i) then i G A""', i + A""' 
if Xl < min(X/^i , ^i) then i ^ A""', i + 1 G A""'. 

Proof. We need only check that the definition of A°'^*' is consistent, in that the criterion for item 
i to be excluded should be the same whether we look at the pair (i, i + 1) or the pair {i — l,i). 
(Of course this is intuitively clear from the consistency in the finite setting of Lemma [U but let 
us give an algebraic verification anyway.) We need to check 

{Xt < min(X,^i,e.)} = {Xf < min(Xf„i,ei-i)}. 
Using the recursions (|29|25p for X/^ and Xj", we need to check 

{max(l - X^Li, 1 - < min(X/^i, ^i)} = {max(l - X/^^, 1 - (i) < min(X^_i, e»-i)}. 

But these are equal by applying the transformation n ^ 1 — n to the right side. ■ 

Because the rule defining A°'^^ is translation-invariant, the augmented triple process 

{{Xt,^i,X,1„l{i G A-*)), -oo < i < oo) 

is also stationary. The next lemma shows this process is the limit of the corresponding finite- 
n process. The mode of convergence can be viewed as a very elementary case of local weak 
convergence [4] of random graphical structures. In words, it asserts that relative to a random 
time-origin the finite processes approximate the limit process. 

Lemma 8 Let Un be uniform on {1, . . . , n}. As n ^ oo 

{iXn,Un+i,Cu„+i,X^^UMl, l{Un + i G A°^^')), -OO < i < oo) 

S ((Xf,e„X^i,l(iG^°n), -oo<i<oo) 
where the left side is defined arbitrarily for Un + i {1, . . . , n} and where convergence in distri- 
bution is with respect to the usual product topology on infinite sequence space. 
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Proof. Because the X's are bounded and the ^'s are i.i.d., the sequence of processes is tight in 
the product topology. Write 

{{xt,LA1i,Hi e i-*)), -oo < i < oo) 

for a subsequential weak hmit. Clearly (^j) = (^j). Because for each n the process (X^j,^,) 

satisfies recursion ([2T]) . the limit satisfies this recursion, and so by the "uniqueness of 

joint distribution" assertion of Lemma [6l {Xj",^i) = Applying the same argument to 

we deduce 

((X^„+i,ec/„+i,^^t/„+m)' -oo<i<oo) ^ ((Xf,ei,X/ti)' -oo<i<oo). 

For fixed Iq the event iq E A""^* is a function of the limit process, the function implied by 
Lemma [71 and by a standard fact ([6] Theorem 5.2) it is enough to check that this function is 
a.s. continuous with respect to the limit process. But this just requires that the probability 
of an equality between some two of X^^, should be zero, which follows from their 
independence (pOj) and existence of densities (j7|24p . ■ 



2.4 Proof of Theorem [T] 



Because 

n n—l 

Mn = Y, Hi e AT) - Y^mi G AT,i + 1 € AT) 

i=l i=l 

we can write 

n~^EMn = P{Un G AT) - E^uMUn G AT, C/n + 1 G AT)HUn + n) 

and then by Lemma [8] 

n"^EM„ ^ c := P(0 € A°'^') - E^ol(0 € A°'^\ 1 G A""'). 

Note that clearly c < 1; the other inequality c > 1/2 holds because the subset {1, 3, 5, . . .} is a 
feasible choice. 

We now exploit the method of bounded differences [10] in a very routine way. We observe 
that Mn = ran{ii, ■ ■ ■ ,(,n) for a certain function m„ with the property 

changing any one argument of nin^Zi, . . . , z^) changes the value of mn{-) by at most 
1 



This property holds because AT will never contain a pair + 1) for which > 1. And this 
property implies the well-known Azuma-Hoefi'ding inequality of the form (see e.g. |14j ) 

P(jM„ - median(M„)| > t) < 4exp(-|^). 

It is now routine to use this large deviation inequality to establish the a.s. and convergence 
of n~^Mn to c. 
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To evaluate c, abbreviate {X^ , ^q, X^) to {X^,^,X^) and use the Lemma [7] definition of 
^°p' to write 

P(0 G = 1 - F{X^ < mm{X^, 0) 

= 1 - i(l - P(C < min(X^,X^))) by symmetry 
= i + iP(e<min(X^,X-^)) 

and then 

c = i + ip(e < min(X^, X^)) - Eei(e < min(X^, X^)). (31) 

Recall that X^, ^ and X^ are independent and that X^ and X^ have common distribution 
F given in terms of G by ()24p . So (I3ip constitutes a formula for c in terms of the underlying 
distribution function G of 

We now evaluate c in the special case where ^ has the exponential(A) distribution: 

G{x) = e"^^, < X < oo, 
so that, from formula (|24p. we have: 

g-A{i-aO(;L - e-^'^) e^^-1 



We deduce 



1 - e-^ - 1 



P(^ < min(X^,X^)) = / Xe-^^'P^iX^ > u) du 

Jo 



(e^ - ir Jo 



X 



(e^ - 1)2 







A /e2^(l-e-^) ^ e^-1 



(e^ - 1)H A 
- 2Ae^ - 1 



(e^ - 1)2 ■ 

E^l(^ < min(X^,X^)) = [ uXe-^^'P^iX^ > u) du 

Jo 



(e^ - 1)2 
A 

(e^ - 1)2 



1 , .2 

ue'^" e^-e^" du 







1 ri 



Jo 



A ,^e2^(l-(l + A)e-^) 1 + (A - l)e^ 



(e^ - 1)2 V A2 ' A2 

1 ^e2A_(^2^2)eVl 



A(eA - 1) 
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Combining, 



1 f(e2^-2Ae^-l)-(e2^-(A2 + 2)e^ + l) 

2 ^ A(e^ - 1)2 

1 (| - l)e2^ + 2e^ - I - 1 

2 ^ A(e^ - 1)2 

1 |(e2^-l)-(e^-l)2 

2 ^ A(e^ - 1)2 

1 + 1 1 

2 ^ 2(e^ - 1) ~ A 

1 1 
1-e-^ ~ A' 



3 The upper bound in Theorem [2] 



Local weak convergence (Lemma [8] above and Lemma [TT] below) provides one sense in which 
the 71 ^ oo limit of the solution ^4°^' of the size-n optimization problem is A logically 

different sense is provided by coupling, as follows. Part of the stationary triple process is the 
doubly-infinite i.i.d. sequence (. . . , ^o, ^i, ^2, • • •)• each n use these same r.v.'s ^i, . . . , 
to construct A"^^. Because of boundary effects it is not always true that A""^^ n [1, n] = A"^^. But 
we expect the sets to coincide "away from the boundary", and Lemma [9l^b) below provides one 
expression of this equality. We call this technique localization. 



3.1 Optimality properties of ^4°^* 

Lemma[7]gave a concise definition of A""^^ but did not explicitly identify its optimality properties. 
Lemma [9] below will relate A°'^^ to certain finite optima and thereby allow us to deduce some 
explicit properties. 

The benefit function fniA) and its maximum value Af„ defined at (j5l6p refer to subsets of 
[l,n], and it is convenient to make the corresponding definitions for an arbitrary interval m]: 

m—l 

f[^,m]{A) := 1^1- J^^a(iGA^ + lG^), AC{e,e+l,...,m} (32) 

i=e 

%,m] := , max f[e-m]iA) (33) 
^ Ac{e,e+i,...,m} ^ ' 

and denote by the corresponding optimizing set. 

Lemma 9 (a) //Ci-i + < 1 then i € A""'. 

(b) If i < m and + < 1 and ^m-i + Cm < 1 then A°^*^^ is unique and 

A"-'n[i,m]=Al^^^y (34) 

If furthermore [i, m] C [1, n] then A!^^ n [i, m] = (interpreting Co = if £ = 1). 

(c) If both i, i + 1 e then Ci < 1. 
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(d) If + > 1 then i,i + 1 and i + 2 together cannot belong to A°'^^. 

(e) Let k >2. If [g,g + 2k — 1] is an interval such that > ^g+i > . . . > S,g+2k-i > S,g+2k cLnd 

ej+0+i>l, 9<j<9 + 2k-2 

then A°^^ n[g,g + 2k — 1] must he one of the two complementary alternating sequences in [g, g + 
2k 

Proof, (a) If ^i-i + < 1 then > 1 — ^i-i > and hence from the Lemma [7] definition we 
see that for any possible value of Xf^j^^, we have i G 

(b) First note that both H. and m are in ^jj^p for otherwise adding each element would 
increase /[£,m](^[£'m]) by at least 1 — ^£ and 1 — ^m-i respectively. Next note that by rewrit- 
ing Lemma [5] (which concerns the special case [£, m] = for general m], we have a 
construction of ^["^^j in terms of processes X^^^-^. and X^^^t^^ for i < i < m defined by 
the recursions analogous to (j21l22p . By (a) both i and m are in We have now shown 
-^[i,m]/ ^ ^ ~ = ^£ X^^^-^^^_^ = 1 - (,m-i = -'^m-i; becausc the restricted and unre- 
stricted processes have the same boundary conditions and satisfy the same recursions over [I, m] 
they must agree throughout the interval. Finally, because both endpoints I and m are in ^jj^^^j 
it cannot fit the "complementary alternating sequences" criteria for non- uniqueness (Lemma [3]). 
The same argument works for A"^^. 

One could prove (c,d,e) algebraically from the definition of A°'^^, but it is more intuitive to 
exploit the finite optimality criterion as follows. From assumption ([8]) there are infinitely many 
i with + < 1, and so for any given i there is a (random) long interval [i,m] containing 
i for which by (b) A°'^^ n [i, m] = ^°^^] • In other words, the restriction of A°'^^ to [£, m] is the 
solution of the finite optimization problem ()32p . and we can derive its properties by considering 
the effect of local changes. Now (c) and (d) follow from the observations 

(for (c)): if i, z -|- 1 are in A°^^ then removing i + 1 will give a relative benefit of at least — 1. 
(for (d)): if i,i+l,i+2 are all in A"^' then removing i+1 will give arelative benefit of ^j+i^j+i — 1. 

For (e), consider j € [g,g + 2k]. By (d) we cannot have {j,j -|- l,j -|- 2} C If j and 

j + 1 but not j + 2 are in A°'^^ then deleting j + 1 while adding j + 2 would increase the benefit 
by at least — > 0, which is impossible. It follows that we cannot have -|- 1} C A°'^^. 
Thus A°'^^ r)[g,g + 2k — 1] contains only isolated elements. It is now easy to check that one can 
change ri[g,g + 2k — 1] into one of the alternating sequences on [g,g + 2k — 1] in such a 
way that the cardinality does not decrease, and the end items g,g + 2k — 1 change (if at all) 
only from included to excluded. Thus the change can only increase the benefit; appealing to the 
uniqueness property (b) in a larger interval establishes (e). ■ 

3.2 Proof of upper bound 

In this section we prove the bound 

limsupf^^^ limsupEe„(5) < oo (35) 

<5iO n 
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via a simple construction of near-optimal sets. We first describe a particular configuration. Let 
g,d G Z such that d — g = 2k for some k >2, and consider the sets A and B below 

g - (g + l) - (g + 2k-l) - g + 2k 

A * — o — • — o...»— o — • 

B • — • — o — •...o— • — • 

where |A A i3| =2k — \ and the difference between the benefits of A and B is: 

f[g,g+2k\{A) - f[g,g+2k\{B) = {k + I) - {{k + 2) - - ig+2k-l) 

= ig+ ig+2k~l - 1- (36) 

Now fix /c > 2 and a > such that ak < 1/2. Consider the event i}g defined by: 

^9 > '^9+1 > ^9+2 > • • • > ig+2k-2 > (,g+2k-l > f > (,g+2k; 

S.g-1 + < 1; S.g+2k-l + (,g+2k < 1- 

By assumption ([7]) this event has non-zero probability. If this event occurs, Lemma [9K a) shows 
that contains g and g + 2k, and then Lemma O^e) implies that A°^* H [5, 5 + 2k] is the set A 
above. By applying Lemma [9)[|b) to [£, m] = [g,g + 2k] we have the analog in the finite n setting: 
if Qg occurs for [g, g + 2k] C [1, n] then A"^^ ri[g,g + 2k] is the set A above. So if we change A"^^ 
by replacing pattern A by pattern B on such an interval, then from (|36p the decrease in benefit 
equals + ^g+2fc-i — 1 > 0. Now define 

=ngn{l<Cg + ig+2k-l < 1 + 2ka} 

g(a)=P(17(-)) 

ria) = EiCg + Cg+2k-i-l)mi''^) 

so that r(a) is the unconditional mean increase in benefit from the possible change, now per- 
)(") 



formed only if event Qg happens. Using assumption ([7]) we see that (^g + Cg+2k-i) restricted 

1, 

a) ~ qa, r{a) ~ fa^ as a | (37) 



(a) 

to ^}g has a continuous density which is non-zero at 1, which easily implies that for fixed k 



for constants r € (0, 00) 

Giv 
and let 



Given n and the optimal set A'^^, construct a near-optimal set B^^ as follows. Let gi = 1 



[gi,gi + 2k], [g2,g2 + 2k], [g^, g?, + 2k], . . . ,[gj„, gj^ + 2k] 

be the adjacent disjoint intervals in [l,n] containing 2k + 1 integers. For each such g = gj, if 
event Og"^ occurs, then on [g,g + 2k] replace pattern A by pattern B. 

Letting n ^ 00 and using the weak law of large numbers, we get 

i l^i") A AT \ 2kq{a)/{2k + 1) in probability, 

lUn{AT) - UB^n^)) ^ r{a)/{2k + 1) in probability. 
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If ^I-Ba"'' a < kq{a)/{2k + 1) then redefine Bli"' to be the empty set. Then (taking A; = 3 
for concreteness) 

^\Bt^AAT\ > 3(z(a)/7 
nmi(E/„(Ar)-E/„(S("))) = r(a)/7. 

n " 

The upper bound ([35ll now follows from the a —>■ asymptotics (|37|) . 



4 Proof of Theorem [2] : the lower bound 



4.1 Analysis of near-optimal solutions: the quintuple process 

Throughout section H] we fix a constant r > such that 

G(i-T)>0. (38) 

Such a constant exists by assumption ([7]). To study near-optimal solutions, fix a Lagrange 
multiplier 9 such that 

O<0 <T. (39) 

We will derive the existence of, and derive an exact expression for, the function £{6) = lim„ Een('^) 
when 6 is sufficiently small. The expression is an implicit function representation £{5{6)) = £{6) 
via two functions e{6),5{9) defined (I49|50p in terms of the stationary distribution of a certain 
quintuple process. 

We study the modified optimization problem in which we get an extra reward 6 for choosing 
an item which is not in A"^^ or for not choosing an item which is in A^^: 

max I 1^1 - ^Cil(i eA,i + le ^) + 6l[A A | . (40) 



i=l 



To study this we modify (I13|14p to 

V^^i= max I |AI-^ejl(j,i + le^) + ei(AAAr)n{l,2,...,i}| I (41) 

iGAC{1,2,...,i} y ^ 

W^^,= max ^ I \A\-yj,l{j,j + l€A) + e\{AAAT)n{l,2,...,i}\ \ . (42) 



rax \A\ - Y^^jHjJ + 'i- ^ A) + 9\{A A AT) n {1,2, ... ,i}\ 



We also define M„ = max(y^„, W^^) and write B°^^ for the corresponding optimizing set. Note 
that these quantities depend on 9. Analogous to the definition of X^- we define 

■ ■= V^- - W^- 

Then as the analog of (jl6p we can obtain the recursion 

= 1 - min(Z^,„ei)l«i > 0) + eJn,^+l 
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where 



Jn,i = i{i^AT)-HieAT). 

Recall from section [T3] the stationary triple process {{X^" , ^i, X^-^^), — oo < i < oo) and define 

Ji = l{i ^ A""') - l(i e A""'). 

Just as the stationary triple process is interpretable (Lemma[8|) as an n — > cxo limit of the process 
{X^^,^i,X^-^^), we want to define a process which wiU be the limit of {Zf^-,X^^-,^i,X^^-^^). 
So define a quadruple process (Z/", X/', ^j, X/^j^) to be a process such that 

(i) {X[ , ^i, X^-^) evolves as the triple process 

(ii) satisfies the recursion 

Zt+i = 1 - min(Zf , ^i)!!^/- > 0) + eJi+i. (43) 

Recall 0<9 <T. 

Lemma 10 The quadruple process {{Z[" , Xf" , ^i, X^-^), —oo < i < oo) has a unique stationary 
distribution, for which 

= V'(...,e^-2,e^-l,e^,^^l) (44) 

for a certain function ip not depending on i. On the event {^j-i + ■^i < 1 — t}, we have 

X[_^, = 1 - , Zt+i = l-Ci + eJ^+i. (45) 

Proof. Recursion (jl3|) implies Zj"_^-^ > 1 — + 6Ji+i- Thus iterating once ([13|) and using this 
last inequality, we obtain: 

1 - + ej,+i < zt+i < 1 - min(i - Ci-i + ej„c,)i{i - + 9Ji > 0) + ej,+i. 

Thus, on the event + £,i < I — 9} we have Z^^^^ = 1 — + 9JiJ^i and also, by (f27|l . we 

have Xj^^ = I — Cii establishing (P5]) . Assumption ([7]) implies that the event {Ci-i < 1 — t"} 
occurs for infinitely many i < 0, so in particular K := max{i < : + < 1 — r} is finite. 
By the recursion (I43p we can write Zq in the form 

^0 = '^^{Ck+i,(,k+2, ■ ■ ■ Zk+i, Jk+2, Jk+3, ■ ■ ■ ,Jo) 
for some function Tp^. Then by (j45p with Zj" = Zj^j^-^ we can rewrite as 

Zq = '^'^iCK,£,K+i, £,K+2, ■ ■ ■ Jk+1, Jk+2, Jk+3, ■ ■ ■ , Jo)- 

By the definition of each Jj is a function of X^ ,S,i,X^^, and then from the recursions for 
X/^ and Xf 

■^0 = '4^^{iK,iK+l,iK+2, ■ ■ ■ ,io]X^+l,Xi). 
By (05]) with Xf- = X^^^ this is of the form 

Zo^ = ^(...,e-2,e-i,Co,^f) 
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Now dm defines a stationary version of the quadruple process. ■ 

Just as X^^ was the "looking right" analog of the "looking left" process X;^-, we can define 
a "looking right" process Z^- analogous to Z^- as follows. Define 

V^i= max I Aj + lGA) + ^|(^A^°^*)n{i,i + l,...,n}| I (46) 

i&AC{i,i+l,...,n} \ ^ — ^ / 

V / 

W^,i = ^^^^ffax (^A\ - e A,j + leA)+9\{AA ^-*) n + 1, . . . ,n}|j .(47) 

Then the difference Z^^ = V^^ — W^^ satisfies the recursion 

Z« = 1 - min(Z^,+i, e,)l(^5+i > 0) + dJn,C ^^n = 1 + OJn,n- 

Recall that 5°^' attains maxAc{i,...,n} (l^l - Er=i^ Cil(i G ^, i + 1 G ^) + e\A A . As in 

section 12. 2| we can write down the benefits of each of the four possible choices for includ- 
ing/excluding items i and i + 1, and thereby obtain criteria for which combination is used in 
See Table 3, in which {Z^^,^i, Z^,-^j^) is abbreviated to {Z^,^, Z^) and the n subscript is 
dropped. 



(i + 1)— absolute benefit relative benefit when used 

v^^ + \/« - e + ^(%Aopt + i.+i^Aopt) z^ + z'^-c i<Ynm{z^ -ej,,z^-e,h+i) 

Vj^ + 1^-" + 6l(l,^^opt + l,+igA°pO Z^^ej^ {Z^ -9J,+i)+ <mm{Z^ -9J, 

W"^ + V^' + 0{l,^Aopt + h+i^Aopt) Z^ -ej,+i {Z^ -ej^)+ <mm{Z^ -0J,+i 

4- + e'(ljg^opt + li+ig^opt) otherwise 



— • o — 

— o • — 

— o o — 
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Table 3. Inclusion criteria for i,i + 1 in i?° 

It should now be clear that the stationary quadruple process can be extended to a stationary 
quintuple process 

(Z/',X/',ei,X,^i,^^i),-oo < i < oo 
in which Z^ satisfies the recursion 

Z^ = l- min(Z/^i,ei)l(Z/^i > 0) + eJi, -oo < i < oo 

satisfied by Z!^^^. By "reflection symmetry" between Z^ and Z^ , the functional relationship (j44p 
holds for Z^ in reflected form with the same function ip: 

= V(...,em,e*,e*-i,^f-i). (48) 

We can now use the stationary quintuple process to define a random subset 5°^' C Z by specifing 
that, for each pair (z,i + 1), we use the one of the four choices which has the largest relative 
benefit in Table 3. Analogously to Lemma [7] one can check this definition is consistent. The 
local weak convergence property (Lemma [8|) extends to the present setting as follows. 
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Lemma 11 Let Un he uniform on {1, . . . , n}. As n ^ oo 

((Z^,^„+„X^,^„+„ef/„+.,<^„+i+i,^^t;„+i+i,l(t/n+i G KnMUn+i G BT)), -oo < z < oo) 
^ ((Zf,Xf,e^,X,^i,Z^„l(ie^-'),l(^Gi?-')), -oo<i<oo). 

Proof. The proof repeats the proof of Lemma[8l using (|44|48p to incorporate the {Z^ , Z^) terms. 
In order to incorporate the 3°^^^ component, we need to check that the function 1(0 E 5°^') 
is a.s. continuous with respect to the stationary distribution of {Zq,Xq,^o,X^,Z(^). From 
Table 3, we get that {0 G 5°^*} = {Z^ - OJq > min(Co, max(Zf - 9Ji,0)}. Hence, it requires 
that the probabihty of an equahty between some of two Zq — 9Jq,^q,Z^ — 6Ji is zero. We 
only check that ^{Zq — 9Jq = ^o) = 0. The recursion satisfied by Zq reads Zq — 6Jq = 
1 — mm{Z^i,^-i)l{Z^i > 0). Thus, arguing as in the proof of Lemma [TOl Zq — 6Jq is a 
function of {Zj^_^^,(,K+i, ■ ■ • Jk+i, Jk+2, • • • , ^-i) with K = max{i < : Ci-i+Ci < 1-t}. 
Since Z^^^ = 1 — C_fc + 9 Jk+i and Jj G {—1, 1}, we deduce by recursion that there exists a pair 
of integers (zq, n) with K <iQ < —1 and —K < n < K such that Zq G {1 — + n9, + n9}. 
The independence of and for i < and assumption ([7]) imply that P{Zq — 9Jq = ^q) = 0. 
■ 

Now define 

6{9) = P({0 G A {0 G 3°"'}) (49) 

e{9) = P(0 G A°'"') - E^ol(0 G A°'^\ 1 G A°'^') - P(0 G B°''') + E^ol(0 G 5°"*, 1 G S°p')(50) 

So 5(0) is the proportion of items at which A°'^^ and 5°''* differ, and £{9) is the difference in 
mean benefit per item between ^4°^' and B°'^^ . By Lemma [TT| 

-E|^°p' A = E|l(C/„ G <•'') - l{Un G S°P')| 

n 

^ P({0 G A {0 G B°P'}) = (5(6*) (51) 
and similarly the mean benefits satisfy 

n-\^fn{AT)) - EfniBT)) - e{9). (52) 



Proposition 12 Let Mn = fn{B^''*) be the benefit associated to B^''\ then a.s. and in L^ 

lim u-^IB;^"' a A°^^'\ = 5{9) (53) 
lim n-^{Mn - Mn) = e{9). (54) 

n— >oo 

Moreover for any choice B'^ satisfying 153)) in L^ , the associated benefit M'^ = fn{B'^) satisfies 

liminf n"^E(M„ - M^) > £{9). 
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Proof. The convergence assertions (j53l54|) follow from (j51l52|) and the same concentration 
argument used in the proof of Theorem [U we will not repeat the details. By construction, for 
any B'^ the associated reward satisfies 

m; + e\B'„ A AT\ <Mn + e\BT A A°^'\. 

Then because both [B^^) and [B'^) satisfy ([53]) . we see that 

em;; <EM„ + o(n). 

■ 

Discussion For Q < 6 < t and for 6 = 6{9), Proposition 1121 implies that the limit e{6) = 
lim„ Ee„ (6) exists and that 

e{6ie))=e{e). 
So to prove Theorem [2] it should be enough to prove 

6{e) ~ a9, e{9) (39^ as 6 ^ (55) 

for positive constants a, (3. Now the definitions (|49|50p enable us to rewrite (using Table 3) 6{9) 
and £{9) in terms of the stationary distribution {Zq ,Xq ,^_q,X^, Z^) of the quintuple process, 
as 

5i9) = P({Xo^>min(Xf,eo)}A{Z(f-0Jo>min((Zf -0Ji)+,eo)}) (56) 
£{9) = F{X^ > min(Xf , ^o)) - P{Z^ - 9 Jo > min((Zf - 9Ji)+, ^o)) 

- E^o (1(^0 < min(X(f , )) - 1(^0 < mm{Z^ - 9 Jo, - 9Ji)) . 

So if we had an explicit formula for the stationary distribution {Zq , Xq ,^o, ^i-, Zi-), then we 
could derive an explicit formula for 5{9) and e{9) and seek to prove ([55|) by calculus. But we do 
not have such an explicit formula - note the independence property (I30p of the triple process 
does not hold for the quintuple process - and we have not completely succeeded in that program. 
We can prove (see Appendix) the 6{9) ~ a9 part of (f55l) . though we only use the weaker upper 
bound, proved by a simpler argument in section 14.21 . To handle £{9) we show how to rewrite 
S{9) and e{9) in a different way (Proposition llSp that allows us to derive inequalities, which will 
establish the stated form of Theorem [2j 

4.2 Existence of the limit function £(6) 

There is a minor technical point we deal with first. We expect intuitively that the function 5{9) 
should be continuous monotone, but neither property is obvious. If there were small values of 6 
which were not of the form 6 = 6(9) for some 9, then we can't use Proposition [12] to establish 
existence of a limit e((5). Instead we outline an argument (reusing previous methods) to prove 
more abstractly (Lemma [13]) that the limit e{5) always exists. We could have started the proof 
of Theorem [2] this way, but we wanted to emphasize the Lagrange multiplier approach as more 
useful for calculation. 

Lemma 13 e{6) := limnEe„(5) exists, for each < 6 < I. 
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Note that is a priori non-decreasing in 5, and hence e(-) is non-decreasing. 

Outline proof. Fix < 5 < 1. Let Bn^ attain the minimum in the definition ([UJ of en((5). Set 
e* (5) = lim inf „ Ee„ {6) . There exists a subsequence (of the subsequence of n attaining the liminf) 
in which the local weak convergence (Lemma [8]) of A°^^ to A""^*- extends to joint convergence of 
Bi'^ to some hmit random set B^^\ The analogs of ([Ml [50|) with B^^^ in place of S"""' equal 6 
and e*((5). For arbitrary n, start with the restriction {B*, say) of 2?^'^^ to [l,n] and then show 
that by modifying i?* near the endpoints we can construct B^* satisfying A > 5n and 
E[n-i(/n(Ar) - fn{B:*))] ^ e-*(<5). . 

The following Lemma (to be proved in Section [4. 4p allows to complete the proof of Theorem 

El 

Lemma 14 There exist positive constants Ci, C2 such that, for all < < t, 

6(9) < Ci9 (57) 
e{9) > C29^. (58) 

We now finish the proof of Theorem El Recall that Proposition 1121 showed £{5{9)) = £{9), and 
that (Lemma I13p e(-) is a non-decreasing function. Using (I57p 

e{Ci9) > e{6{9)) = e{9) > C29^ 

and setting 6 = Ci9 gives e(5) > C2(5^/C^. This establishes the lower bound (fTTj) and completes 
the proof of Theorem El 

4.3 A cycle formula representation 

Lemma 15 + < I - t then i G A""* and i £ B""'. 

Proof. Suppose ii-i+ii < 1 — r. Lemma[9l^a) showed i G Recall that 5°^* maximizes (j^Oj) . 
Hi ^ A then the increase in the benefit at (j40p obtained by including i is at least 1 — — — 9, 
so by our standing assumption (I39p the increase is positive and so i € -B^"^*. Letting n ^ 00 and 
using Lemma [TT] gives the same conclusion for 2?°^'. ■ 

We next need a lemma (analogous to Lemma [9j^b)) giving conditions under which we can 
"localize" A°'^^ and -6°"^' by forcing them to coincide with the optimal sets A'^^ and B^^ for the 
optimization problem on [l,n] for suitable n, which we now write as t — 1. 

Lemma 16 Let t > 2. Suppose -|- < 1 — r for each of i = 0,1, t — l,t. Then: 
(a) A'P' and B""' contain {0,1, t - l,t}. 

(h) The restrictions of A°^* and B"''* to [l,t — 1] coincide with A'^'^-^ and B^'^-^. 

(c) For any B^ {1,2,..., t- 1}, either B = or ft^i{B) < ft-i{A°/^^). 

(d) In particular, either A°^^^ = B^^^ or ft-i{Al'^j^) > ft-i{Bl'^_^^). 

Proof, (a) follows from Lemma [T5l Observe that A°^^^ and B^^^ contain 1 and t — 1, because 
^1 < 1 — r and ,^t-2 < 1 — r. If we consider the solutions A°,^^^-^, B?^\ for some interval [l,m] 
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strictly containing [0,t], then they contain 1 and t — 1 by the argument for Lemma [T^ Thus by 
optimahty the restrictions of and to — 1] must coincide with A^^-^ and -B^^\- 

So (b) follows from weak convergence, Lemma [TTl And (c) follows from the uniqueness result, 
Lemma [3J ■ 

We start by quoting a standard form (cf. [5] Exercise 6.3.4) of Kac's identity for stationary 
processes. 

Lemma 17 Let (Hj, —oo < i < oo) be a stationary ergodic sequence on some state space, let 
P(Hi € l)) > and let be real-valued and integrable. For any to > 1, define T = to mm{i > 

2 : Hjip € D). Then 



E/i(Hi) = E 



T-1 
i=l 

We apply this to = (Zf , , ^i, ^i^i, ^^-2, ^/^i' ^i+i) ^nd to = 3 and 

D := {^-1 + Co < 1 - r, Co + ei < 1 - t} = {Hi G 5} (59) 
for suitable D, making the T in Lemma [T71 be 

r = 3 min{t > 2 : ^34-2 + Cst-i < 1 - r, ^st-i + 6* < 1 - r}. (60) 
Now definition dH]) says 5(6) = E/i(Ho) for 

/i(Ho) = 1({0 G A {0 G -B°P*}). 

So equals the cardinality of A 5°^* restricted to [1,T - 1]. On the event 

D, Lemma [16] identifies this restriction as A"^^^ A B^^^, so Kac's identity gives ()6ip below. 
Similarly, definition (|50p says e{6) = E/i(Ho) for 

/i(Ho) = 1(0 G A""') - Col(0 G 1 G ^°''*) - 1(0 G -B°P*) + Col(0 G B""', 1 G S"""*) 

and on the event D the sum YlJ=i ^{^i) equals the difference fT-i{A°r^^_^) — fT-i{B'^^_^) between 
the benefits. This establishes ()62p . and the final assertion (|63p follows from Lemma [TBTd). To 
summarize: 



Proposition 18 Let D be the event ^59^) and let T he the random time \60\l. Then 

6{9) = E[1d X \A°^U ^ (61) 

8(9) = E[1d X ifT-iiA°^U) - fT-iiBr'-i))]- (62) 

On D, either A°^'_^ = B°^'_^ or fT-i{A"^U) " /t-i(^t-i) > 0- (63) 



4.4 An integration lemma 

Let us rewrite the difference in (I62p as 

W{e) := /t-i(At-i) - /T-i(i?T-i) 

to emphasize its dependence on 9; and note D does not depend on 9. The key ingredient in the 
proof of the lower bound is the following lemma, to be proved in section 14.51 
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Lemma 19 There exists C3 > such that for all < 9 < t, for all k > and x > 0, 
P(r > A:, < loWiO) <x)< C3x(k + l)P(r > k). 

Taking A; = in this lemma we get 

P{0 <1dW{9) <x) <C3X. (64) 

Recall a simple integration lemma (for a more general result see [2] Lemma 6(a)): 

Lemma 20 Let V >0 be a real-valued random variable such that 

F{0 <V < x) <Cx, 0<x <oo. 

Then 

By and Lemma [201 we get 

e{9) = E{1dW{9)) by §^ 

^ [P{W{9)1d>0)]^ _ ^g^^ 
~ 2C3 

To finish the proof of (jSSp . we need the following lemma 

Lemma 21 There exists a positive constant C4 such that, for all < 9 < t, 

P{W{9)1d>0)>C49. (66) 

Proof. By assumption ^ we may assume that the constant r at (j38p is such that 

inf g(x) > (67) 

1/2-2t<x<1/2t 

where g is the density function for ^j. Consider the event: 

ni9) = {e-i G (0,1/2), Co G (0,1/2 -r), 6 G (1/2- r, 1/2), 
6 G (1 - 6 - ^, 1 - ei), ^3 G (0, 1/2 - 2r)} . 

Using (f67ll there exists C4 > such that 

P{n{9)) > Ca9. 

Assume this event Q{9) happens. Then + ^0 < 1 — ''") Co + ^ 1 ~ ''"i 1 ~ ^ < + < 1 and 
+ ^ 1 ~ So D happens and using Lemma [9|^a) , we have {1, 2, 3} G and by Lemma 
[TBT b) the same holds true for A^^^. Still assuming Q{9) occurs, we see that for B = A'^^_^\{2}, 
we have fT~i{A°rf'_^) - /t-i(-B) = 1 - ^1 - ^2 G (0, 9) and therefore /t-i(5) + ^l^^-i A ^1 > 
/t-i(A5?1i), implying < W{9) by §3^. In particular 



P{W{9)1d > 0) > Pin{9)) > C49 
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and we have proved the assertion (|66p . ■ 

From (|65p and (|66p , we directly get the second assertion (j58p of Lemma [TH We now show 
how to obtain the first assertion of Lemma [T^ Recall that by definition, we have 

/t_i(S^^!i) + e\A°^U A i?T-il > /t-i(At-i), 

hence we get OT > e\A°rl^'_-^ A B°^l-^\ > W{e). In particular, by Proposition [HI we have D n 
{W{e) >Q] cDf^{^T > W{e) > 0}. Also by Lemma [H we have 

d{e) < E[TiDHw{e) > 0)] by dm) 

< ^iP(r>j,0i > >o) 

j 

< C30^/(i + i)P(r> j) 

j 

< C3eE[{T+l)% 

and r/3 has a geometric distribution so that assertion (j57p of Lemma [HI follows. 

4.5 Proof of Lemma 119! 

Write W = W{9). Consider the random collection 

B{T - I) := {B C {1,2, ... ,T - 1} : A°]:'_^, I £ B,T -I e B}. 
By Proposition [THl 

on D, either W = or W > ^ mm ^^{fT-i{A°^'_^) - /t-i(5)) > 0. (68) 

Our first goal is to derive a lower bound (Lemma I24p for the right-hand side of (|68p in terms of 
the ^j's. Until the end of the proof of Lemma [Ml we are working on the event D. 

Let C = argmin5gg(7-_i)(/T_i(yi;^^-i^) — fT-i{B)) be the optimal perturbation of A°'^^ on 
[1,T— 1]. For any subinterval X = [i,m\ C [1, T— 1] write = [max(^— 1, 1), min(m + 1, T— 1)]. 
Decompose A""^^ A C as UjXj where the Zj's are disjoint maximal intervals of A°'^*- A C. Then 

fT-l{A°n-fT-l{C) = n (X,)e) - /(X,).(C n (J,)e)) = (/t-i(^T-i) " /t~i(C^ 

i i 

where Cj = {A°^^_^ n I^'') U (C n (2i,)e). This implies that A C is a single subinterval I of 

[i,r-i]. 

We now look at the possible perturbations of A°^^ on the interval [0,T]. Recall that we are 
working on the event D, and that contains 0, 1, T — 1, T. Let Lq, Li, . . . , Lk be the maximal 
subintervals [a, b] C n [0, T] for which b > a, that is with at least two elements. So we can 
partition [0, T] as Lq U S'o U Li U S*! U . . . U Lk where the Sk are the complementary intervals. 
We call the L^, lakes and we call the Sk switches. 
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Lemma 22 Let L = [a, b] he a lake. For any set B G B{T - 1) such that 5 n = A°^' n L" 
and hence B n L ^ A°''* n L, we have 

fT-i{A"n-fT-i{B)>mm\l-U,l-^k-i, min 1 - - 1 > 0. (69) 

I a<i<b—l \ 

Proof. First suppose B is obtained by removing from A°'^^ a single item. If this item is a, we 
have fT-i{A°^') - fT-i{B) = I - ^a, if it is b, we have fT-i{A°^') - fT-i{B) = 1 - 6-i and 
if it is z G (a, b) then we have /t-i(^°''*) — /r-i(-B) = 1 — — ^i. So by optimahty of A°r^^_^ 
the first inequahty in ()69p holds for these -B, and Lemma [TBI implies the last inequality in (I69p . 
Now recall that Lemma[9|^c) shows min(j<j<f,_i 1 — .^j > 0. So construct a general B by removing 
items from A°'^^ one by one, and for items after the first the benefit can only decrease. So the 
first inequality holds generally. ■ 

Lemma 23 Let S = [a, 6] he a switch and 5e = [a — 1, 6 + 1] . For any set B such that BnS^ = 
A°p' nS^ and BnS ^ A""' n S, we have 

fSei^""*) - fSe{B) > min <^ min + - 1, min min - min - ^b+i, 1 - ^a_2 - 

\^a—l<.i<j<.b a—l<.i<b a<.i<.b a<i<.b 

Proof. By construction a switch starts and ends with items not in A°'^^, and the two items 
before and after the switch are in 74°"^'. Moreover, Table 2 shows that two adjacent items cannot 
both be not in so the items in a switch [a, h\ must strictly alternate between in and not-in 
74°"^', as illustrated in Figure 4. 

We first consider a set B obtained from by fiipping all items in some subinterval [u, v] 
of [a, b]. There are four cases, corresponding to whether the endpoints u, v are in or not-in ^4°^'. 
We exhibit these cases below, labeled as e.g. ["...o], together with the value of the benefit 
change fsM^n-fsAB). 



an V b 

^opt a — • — o ... • — o — • ... • — o — • ... o — • — • 
B • — • — o. — • — 0...0 — • — •...o — • — • 

Figure 4. Case [o . . . o], where a < u < v < b. Benefit change = ^u-i + ^t, — 1. 



a u V b 

^opt « _ a — o ... • — o — • ... • — o — • — o ... o 

C • — • — O. — • — 0...0 — • — O — 0...0 

Figure 5. Case [□...•], where a<u<v<b — 1. Benefit change = £,u-i- 
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a u 
D • — • — 0...0 — o 



V h 

— o — • ... • — o — • ... o 

— • — o ... o — • — • ... o 



Figure 6. Case [• . . . o], where a -\- 1 < u < v <b. Benefit change = 

In the fourth case [•...•], the benefit change equals 1. 

We also need to consider cases where the flipped subinterval [u, v] has u = a — loTv = b + \ 
or both. There are five cases, indicated in Table 4. 

[u,v] [a-l,o] [a-1,.] [o,6 + l] [•,6+1] [a -1,6+1] 

benefit change - 1 - ia~2 iu - ib+i 1 - ib+i 1 - ia-2 - Cfe+i 

Table 4. 

Now consider any subset B satisfying the hypothesis of Lemma [23j Decompose j4°p' A B 
into disjoint maximal intervals T,. It is easy to check that the benefit change between vl""^' and 
B is just the sum of the separate benefit changes between A°'^^ and A°'^^ with interval Xj flipped. 
Thus the minimum over B is attained by one of the cases we have considered, establishing the 
lemma. ■ 



Lemma 24 Set w = mini<j<j<j'_i{|^j + — l|;^i; |1 — — On the event D, either 

W = Q orW >w. 



Proof. We need only consider the case > 0. Recall that C = aigmm.B^B{T-i)[fT-i{-^T-i) ~ 
fT~i{B)) is such that A C is a single subinterval T of [1, T — 1]. It is enough to show that 
C satisfies the assumptions of Lemmas [22] (for some lake) or the assumptions of Lemma [23] (for 
some switch), for then the lower bound w follows from the lower bounds in those lemmas. 

We argue by contradiction: if false, then I intersects some lake and some adjacent switch, 
say Lk and Sk (the case of and Sk-i is similar). So there exist a < b < c such that b = supL^ 
and 2 = [a,c]. Now check 

if c G then f{B) - f{C) = 1 for S := C U {6, 6 + 2, 6 + 4, . . . , c}\{6 + 1, 6 + 3, . . . , c - 1} 

if c0^°p' then /(B) - /(C7) = for 5 := C U {6, 6 + 2, 6 + 4, . . . , c - 1}\{6+ 1, 6 + 3, . . . , c}. 
Either case contradicts the optimality of C. ■ 

We may now complete the proof of Lemma [19] The key point is that the bound w in Lemma 
[25] does not depend on 6. From Lemma [2^ 

P(r > 3/;:, < W{e)lD <x) < F{T > 3k, D;0 < w < x) < P{T >3k, w < x) 

< P I T > 3A;, min l^j + - ll < x ) + P ( T > 3k, min ^ < x] 

V l<i<i<T-l / V l<i<T-l J 

+P(r>3/c, min l^j - 1[ < x ) + P ( T > 3/c, min - Ci\ < x 

V l<i<T-l J \ l<i<j<T-l ■' 

The 4 terms on the right hand side are treated similarly: we will just study the final term, and 
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will prove that there exists C > independent of k such that, 



T>3k) < C{k + l)x. (70) 



P min \fi — f ,1 < X 

\i<i<i<T-i 



P min — f < x 

\^<i<j<T~l 

The effect of conditioning on the event {T > 3k} is that each non-overlapping triple (^sm, Csm+i, C3m+2) 
is conditioned to satisfy either {^m+Csm+i > l-T}U{^3m+i+C3m+2 > 1-t} or {Csm+Csm+i < 
1—7", Csm+i + C3m+2 < 1 — t} (for m = T). It follows that, for any i < j, 

p((e.,ei) G -IT > j) < a''pm,c,) e ■) (71) 

where a = min(P({eo + 6 > 1 - r} U {6 + 6 > 1 - r}), P(eo + 6 < l-T,ei+6 < 1-^)). 
From assumption ([7]) the density of — is bounded by some constant b, and so 

T>3k^ < ^P(|^i-^j| <x,r> j|r>3/c) 

= ^^il^i - < > max(j + 1,3A;))P(T >j + 1\T > 3k) 

i<j 

< ba'^x {j -l)P{T > j + 1\T >3k) 

i>3fe-i 

< ba-'^x 3{j + l)P(r > 3j\T > 3k) 

j>k 

= ba-^x Y 3(j + l)P(r > 3(j - k)) 

< ba-^x{kE[T] + E[T{T + 1)]), 

where we used the fact that T/3 has a geometric distribution. This concludes the proof of 
Lemma 1191 



5 Final remarks 

5.1 Technical assumptions on G 

We stated a single assumption ^ on G. What we actually used was three consequences of this 
assumption: 

• P(^ < 1/2) > 0, which implies P(^i + < 1) > 0, used in Lemma [15] and thereby 
throughout section |4] (because it implies i G A"^*) to implement "localization" arguments. 

• P(C < 1/2) < 1, used in section [32] to show Pi^g) > 0. Note that if P(^ < 1/2) = 1 then 
the optimization problem is degenerate in that the optimal ^"''^ = {1, 2, . . . , n}. 

• has density bounded below in some interval (1, 1 + ?/), used in section [3?2] to obtain 

iZD. 

The latter two are used only in a convenient way to exhibit one near-optimal set. The "localiza- 
tion" arguments essentially just require one to find some event of positive probability involving 
(^_fc, . . . , ^fc) which forces items and 1 to be in (or not in) Lemma \TE\ is just a simple way 
to exhibit such. So we expect Theorem [2] to remain true under much weaker assumptions on G. 
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5.2 Parallels with the cavity method 



The unsophisticated arguments in this paper in the context of i.i.d.-DP (dynamic programming) 
may be compared with the more sophisticated arguments from the statistical physics cavity 
method p^, as reformulated in more probabilistic language in [Tl[3], whose prototype example 
we take to be the analysis of TSP in the "mean-field" model of geometry where there are n 
points and each of the (2) inter-point links has random length. Of course algorithmically DP 
and TSP are quite different, but there are striking parallels between the analysis of optimal 
solutions of iid-DP and mean-field-TSP, as follows. 

• There are n — s- 00 limits for the random data; in DP this is just the obvious infinite i.i.d. 
sequence, while for mean-field-TSP it is a certain random infinite tree. 

• The "inclusion criterion" for iid-DP involves Xf^jX^^^ and the edge-cost ^j. Finite-n TSP 
has of course no simple inclusion criteria, but in the n — > 00 limit of mean-field-TSP there 
is an analogous criterion for inclusion of an edge in terms of quantities Zj^, and 
the edge-length ^jj. Each Z is interpreted (cf. (fT9]) for DP) as the difference between costs 
of two optimal solutions (subject to different local constraints) on one side of the tree. 

• The distribution we use for X in iid-DP, the stationary distribution of a Markov chain, is 

the solution of an equation with abstract structure X = h{^,X^). The distribution we 
use for Z in mean-field-TSP, by a recursion on the limit tree, is the solution of an equation 

with abstract structure Z = h{^; Z^ , Z^, Z^, . . .) where the Z^ are i.i.d. copies of the 
unknown distribution Z. 

These parallels provide a glimpse of how the analog of Theorem [H a formula for the asymptotic 
expected cost in mean-field-TSP, may be derived (the original non-rigorous argument was in 
; a rigorous proof was given only recently via more combinatorial methods [16] ) . The analog 
of Theorem [2] for mean-field-TSP, using Lagrange multipliers as in this paper, and leading to a 
non-rigorous argument that the scaling exponent equals 3, was given in [3j . 

Acknowledgement We thank Vlada Limic for discussions regarding the NK model. 
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A Exact asymptotic for the near-optimal solution 



In this section, we go one step further in the analysis of the quintuple process and prove the 
following refinement: 

Proposition 25 Under the assumptions of Theorem \^ 9 6{6) is differentiahle at and 
y(O) = a > 0. 

The proof is based on a variational analysis which may have an interest in its own. In view of 
Proposition [25l it is tempting to conjecture that there exists /3 > such that 

However, we do not have a rigorous proof of this claim. If (j72p was true then, from Proposition 
[T2| 5 ^ e{5) would be twice differentiahle at and e{5) = Pa~'^6'^ + o{S'^). We will exhibit 
formulae for the constants a and (3. In order to prove Proposition \25\ using (|56p we will write 
a first order expansion 6{-) at 0. We define 

St{B) = Zti9) - 9 J, - Xt and S^+,{9) = Zf^^{9) - 9J,+, - X^^,. 

In the next lemma, we give a first order estimate of Sq{9) and Si{9) as 9 goes to 0. Recall that 
the positive number r was defined by ([MI)- Finally let = inf{i > 2 : + < 1 - r}. 

Note that is independent of {(,o,Xi ). 

Lemma 26 There exist a constant C5 and an integer valued random variable independent 
of 9 such that, for all < 9 < t , 

\So{9)\ < 9K^ and E [K^1{S^{9) / 9Q^)\ < C^9, 

and respectively for Si{9) and = inf{i > 2 : + < 1 — r}. 

Proof. To simplify notation in the proof, we set K = and = S^{9). By Lemma [TOl 
X^j^_^_2 = 1 — S,-K+i and S^j^_^_2 = 0. Now, since the mapping x i-^- min(x,^)l(x > 0) is 
1-Lipschitz (i.e. contracting), we get: 

\St+i\ = \mmiZt,C,)l{Zt > 0) -min(Xf ,e.)l(^^'' > 0)| < \Zt - Xt\ < 9 + \St\. 

In particular: \Sq\ < 9{K — 2), and it concludes the first statement of the lemma. Now, using 
(j43p . we obtain the recursion 

= min(Xf , e^) - min((X/' + + 9Ji) + ,^i). 

Therefore, 

- if > then i G A°^' and Ji = -1. Moreover if Xf- > + 6* - Sf, then Sj^i = 0. 

- if X[ < and -S[ - 9Ji < < - S[ - 9Ji, then we have S[^^ = -Sf - 9Ji. 
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For each integer k, we define the event Ej^ as 

Ek = n^k<i<-i{\xt -Ci\> ke, xt > ke} . 

In particular, on the event Ek fl {S^/^ = 0}, we have for —k < i < —1, 



if Xt > 



-Sf' — 9Ji otherwise. 
We define U = inf{i > 1, X^^ < ^-i}. Since S^^_^2 — Oj the event Ek-2, 

= 1{U<K- 1)9 {-'^yJ^ = GQo- 

i=-U 

It remains to upper bound the probabihty of the event E^_2. Note that we have Xf;^^2 ■ 
1 — ^-K+i so that by the recrusion, we get for any k > 2, 

X^K+k € • • • , C-K+k-l} U {1 - S,-K+k-l}- 

This imphes the following inclusion: 

f] {^j ^[o,K9]u[i-Ke,i]} f] {\^j-^,\>Ke}n{\i-^j-^i\>Ke}cEK-2. 

-K+I<j<-1 -A'+I<j<j<-1 



Fk 

We have 

-1 



P(F^, K = k) < P{(j G [0, ke] U[l-k9,l], K = k) 

j=-k+i 

+ Yl P{\Cj-Ci\(^[o,k9]u[i-ke,i + ke], K = k) 

-fe+l<j<j<-l 

Given K = k, the variables {S,i,—k + 2 < i < —1) are conditioned on the event {min(^j + 
+ Ci+i) > 1 — t}. Similarly, the variable ^-t+i is conditioned on {^-fc+i + 6.-k < 
1 — r} n {^-fc+i + > 1 — 7"} while the variable is conditioned on {6,-k+i + ^-k < ^ — t}- 

Since we condition on events of positive probability and using the assumption that has a 
bounded density, we obtain, for some constant C, for all —k + 1 < j < i < —1, 

F{Cj €[0,ke]U[l-k6,l], K = k) < CkeP{K = k), 
Pi\£,j-£,i\£[0,k9]U[l-k9,l + ke], K = k) < Ck9P{K = k). 

Finally, we get, 

P{S^ / 9qL, K = k)< P(F^, K = k)< C'k^P{K = k)9. 

From (IMl), ^[K^] < oo, therefore E[K1{S^ / 9Ql^] < C'E[K'']9. m 

Now, as usual let {X^,^, X^) := {X^,^o, X^) and similarly, we drop the indices of S^,QI^, 5f , Qf. 

Let 

V = {(x^, z, x^) G : > min(z, max(0, x^))}, 



33 



and dV = V'^ H V is the boundary of V. Note that, from Tables 2 and 3, 

{0 G = {{X^,^, X^) G V}, 

{Og5°p'} = {{X^ + s^,i,x^ + S^) eV}. 

We define the functions 

F{e) = v{{x^,i,x^)£V,{x^ + s^{e),i,x^ + s^{e))^v), 

F{e) = V{{X^,i,X^)eV,{X^ + 9Q^,i,X^ + 6Q^)^V). 
The next lemma states that F and F have the same first order asymptotic as 6 goes to 0. 

Lemma 27 As 6 [ 0, 

\F{e)-m\ = o{e). 

Proof. For x G M'^ and r > 0, let r) denote the closed ball of radius r and center x. From 
Lemma [26| 

\F{e) - F{d)\ < P {e{Q^, Q^) / (5^, S\ B{{X^,C, X^), 9 max{K^, K^)) n aV / 0) . 
Using the "reflection symmetry" from "L" to "i?" we obtain: 

oo 

\F{e)-F{e)\ < 2J2f {OQ^ ^ S^;K^ = k;B{{X^,^,X^),emax{k,K^))ndV ^i/}) . 

k=2 

Since the boundary of V is smooth and the variables (X'^, ^, X^) are independent with bounded 
density, (see (I7l24p ). a simple calculation shows that there exists a constant c > such that for 
aU G [0, 1] and t > 0, 

P {B{{x^,^, X^),t) n 9V ^ 0) < ct. 
Now, fix an integer n, from the independence of X^, K^) and {Q^ , , X^ , K^), we get 

oo 

\F{9)-F{e)\ < 2"^F{dQ^ ^ S^;K^ = k;B{{X^,^,X^),emax{k,n))ndV ^i^i) 

k=2 

+ P {OQ^ + S^;K^ > n) 

oo 

< 2^c9 max{k, n)F {9Q^ / S^; = k) + C^eF{K^ > n) 

k=2 

< 2C5cn9^ + C59P{K^ > n). 

Taking n arbitrary large, we obtain our result. ■ 

We may now conclude the proof of Proposition [25l 
Proof of Proposition 1251 We have 

= J f3ix)lxev'E'xlx+eQfvdx 
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where x = {x^,z,x^), Q = {Q^,0,Q^), f3{x^,z,x^) = f{x^)g{z)f{x^) is the density of the 
triple process, and E(xi,z,x-R) [ ' ] = E[ • X^) = {x^ , z,x^)]. Let T? be the oriented or- 

thonormal vector to the surface dV at x, and dS the Lebesgue measure on dV. Since V'^ is the 
intersection of hyperplanes and Q is integer valued, calculus gives 

^'(0)= / /3(x)E^[max(T?.Q,0)]d5, 
JdV 

where it.Q is the usual scalar product of T? and Q. From Lemma [271 we get -P'(O) = i^'(O). 
Recall that (j56p implies that 

6{e) = F{d) + P((x^, X^) i V, (X^ + S^[B),i, + S^{0)) G V). 

The same computation on the second term gives 5 {9) = a6 + o{9) with 

a = I hix)E^\lt.Q\dS 
JdV 

= / / hix,z,x)E^^^,^^)\Q^ -Q^\dzdx+ / f3{x,x,x^)E^^^,^^)\Q^\dxdx^ 

Jo Jx Jo Jx 

and this concludes the proof of Proposition [25l ■ 

We now discuss the conjecture ([72]). Let ei{e) = P(0 G - P(0 G 5°^') = P(0 G 

- P(0 G we get 

ei(0) = F{{X^,tX^) eV,{X^ + S^,C,X^ + S^) ^V) 

-P((X^, X^) V, (X^ + 5^, e, X^ + S^) G V), 

and from what precedes, we deduce that e'i(O) = 0. Now, we define 

W = {{x^, z, x^) eR^ : z < min(x^, x^)}, 

so that, from Tables 2 and 3, 

{{0,1}CA°P*} = {(X^,e,X^) G W}, 

{{0,1} CB°^'} = {iX^ + S^,tX^ + S^)(£W}. 

Similarly, we introduce the function 

82(9) = E[^l{o,i}cA°ptl{0,l}!z:-B°pt] ~ E[^l{0,i}cBoptl{0,l}!z:A°pt] 

We might prove again that £2(0) = 0. Note that 

£{9)=ei{9)-e2{9). 

Let 

G{9) = E^1(X^, e, X^) G (X^ + S^{9),^, X^ + 5^(6/)) ^ W). 
If we had proved that F and G have a second derivative at 0, then we could obtain 

e{9) = (F"(0) - G"{0))e^ + o{9^) = P9^ + o(02)_ 

However the computation of the second derivatives of F and G involves some technicalities that 
we will not consider in this paper. 
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